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RAW SEQUENCE LISTING 
ERROR REPORT 



The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 

. Application Serial Number: 

. Source: 1 PP^ 

Date Processed by STIC: loj|oU\ J 


THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION QUESTIONS, PLEASE CONTACT MARK SPENCER, 703-308-4212. 

FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, 703-308-4216. 
PATENTIN 2.1 e-mail help: patin21heln@uspto.gOv or phone 703-306-4119 (R Wax) 
PATENTIN 3.0 e-mail help: patin3help@uspto.20v or phone 703-306-4119 (R. Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 3.0 PROGRAM . ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW: 


Checker Version 3.0 

The Checker Version 3.0 application is a state-of the-art Windows based software program 
employing a logical and intuitive user-interface to check whether a sequence hsting is in 
compliance with format and content rules. Checker Version 3.0 works for sequence listings 
generated for the original version of 37 CFR §§1.821 - L825 effective October 1, 1990 (old 
rules) and the revised version (new rules) effective July I, 1998 as well as World Intellectual 
Property Organization (WIPO) Standard ST25. 

Checker Version 3.0 replaces the previous DOS-based version of Checker, and is Y2K- 
compliant. Checker allows public users to check sequence listings in Computer Readable form 
(CRF) before submitting them to the United States Patent and Trademark Office (USPTO). 
Use of Checker prior to filing the sequence listing is expected to result in fewer errored sequence 
listings, thus saving time and money. 


Checker Version 3.0 can be down loaded from the USPTO website at the following address: 

http://www.uspto.gov/web/offices/pac/checker 


Raw Sequence Listing Error Summary 


I: 


ERROR DETECTED SUGGESTED CORRECTION SERIAL NUMBER: 

ATTN: NEW RULES CASES: PLEASE DISREGARD ENGLISH "ALPHA" HEADERS, WHICH WERE INSERTED BY PTO SOFTWARE 


1 Wrapped Nucleics 

Wrapped Aminos 


The number/text at the end of each line '^vrapped" down to the next line. This may occur if your file 
was retrieved in a word processor after creating it. Please adjust your right margin to .3; this will 
prevent "wrapping." 


Invalid Line Length The rules require that a line not exceed 72 characters in length. This includes white spaces. 


3 Misaligned Amino 

Numbering 

4 Non-ASCII 


Variable Length 


_PatentIn 2.0 
''bug" 


The numbering under each 5**^ amino acid is misaligned. Do not use tab codes between numbers; 
use space characters, instead. 

The submitted file was not saved in ASCII(DOS) text, as required by the Sequence Rules. Please 
ensure your subsequent submission is saved in ASCII text 

Sequence(s) contain n's or Xaa's representing more than one residue. Per Sequence Rules, 

each n or Xaa can only represent a single residue. Please present the maximum number of each 
residue having variable length and indicate in the <220>-<223> section that some may be missing. 

A "bug" in Patentln version 2.0 has caused the <220>-<223> section to be missing from amino acid 

sequences(s) . Normally, Patentln would automatically generate this section from the 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section to 
the subsequent amino acid sequence. This applies to the mandatory <220>-<223> sections for 
Artificial or Unknown sequences. 


7 Skipped Sequences Sequence(s) 

(OLD RULES) 


missing. If intentional, please insert the following lines for each skipped sequence: 


(2) INFORMATION FOR SEQ ID NO:X: (insert SEQ ED NO where "X*' is shown) 
(i) SEQUENCE CHARACTERISTICS: (Do not insert any subheadings under this heading) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
This sequence is intentionally skipped 

Please also adjust the "(ii).NUMBER OF SEQUENCES:" response to include the skipped sequences. 


_Skipped Sequences 
(NEW RULES) 


Sequence(s) _ 


missing. If intentional, please insert the following lines for each skipped sequence. 


Use ofn's or Xaa's 
"(NEW RULES) 


10 Invalid <213> 

Response 


<210> sequence id number 
<400> sequence id number 
000 

Use ofn's and/or Xaa*s have been detected in the Sequence Listing. 

Per 1.823 of Sequence Rules, use of <220>-<223> is MANDATORY if n's or Xaa's are present. 

In <220> to <223> section, please explain location of n or Xaa, and which residue n or Xaa represents. 

Per 1.823 of Sequence Rules, the only valid <213> responses are: Unknown, Artificial Sequence, or 
scientific name (Genus/species). <220>-<223> section is required when <213> response is Unknown or 
is Artificial Sequence 


11 \/ Use of<220> 


12 Patentln 2.0 

"bug" 


Sequence(s) 


missing the <220> "Feature" and associated numeric identifiers and responses. 


Use of <220> to <:223> is MANDATORY if <2 1 3> "Organism" response is "Artificial Sequence" or 

"Unknown." Please explain source of genetic material in <220> to <223> section. 

(See 'Tederal Register," 06/01/1998, Vol. 63, No. 104, pp. 29631-32) (Sec. 1.823 of Sequence Rules) 

Please do not use "Copy to Disk" function of Patentln version 2.0. This causes a corrupted file, 
resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence 
listing). Instead, please use "File Manager" or any other manual means to copy file to floppy disk. 
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OIPE 


RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/713,601 


DATE: 10/10/2001 
TIME: 14:08:25 


Input Set : A:\04905.ST25.txt 

Output Set: N:\CRF3\10102001\I713601.raw 


3 <110> 


APPLICANT: Agarwal, Poonam 

Aizenstein, Brian 

Arco, David 

Atiles/ Myrta 

Burris, Deborah 

de Arruda Indig, Monika 

haWf Scott 

Mast, Andrea 

Marshall/ David 

Miller; Carolyn 

Oldenberg, Mary 

Rasmussen, Eric 

Schneiders/ Jennifer 


4 

5 
6 
7 
8 
9 


Does Not Comply 
Corrected Diskette ISieededl 


10 

11 

12 
13 
14 
15 



eror ji^s 1-^ (^-as^Wk^^ 


17 <120> TITLE OF INVENTION: Methods and Compositions for Detecting Target Sequences 

19 <130> FILE REFERENCE: FORS-04905 

21 <14 0> CURRENT APPLICATION NUMBER: 09/713,601 

C--> 22 <141> CURRENT FILING DATE: 2001-09-24 

24 <150> PRIOR APPLICATION NUMBER: 09/350,309 

25 <151> PRIOR FILING DATE: 1999-07-09 

27 <150> PRIOR APPLICATION NUMBER: 09/381,212 

28 <151> PRIOR FILING DATE: 2000-02-08 

30 <150> PRIOR APPLICATION NUMBER: 08/823,516 

31 <151> PRIOR FILING DATE: 1997-03-24 

33 <150> PRIOR APPLICATION NUMBER: 08/759,038 

34 <151> PRIOR FILING DATE: 1996~12-02 

36 <150> PRIOR APPLICATION NUMBER: 08/756,386 

37 <151> PRIOR FILING DATE: 1996-11-26 

39 <150> PRIOR APPLICATION NUMBER: 08/682,853 

40 <151> PRIOR FILING DATE: 1996-07-12 

42 <150> PRIOR APPLICATION NUMBER: 08/599/491 

43 <151> PRIOR FILING DATE: 1996-01-24 
45 <160> NUMBER OF SEQ ID NOS : 253 

47 <170> SOFTWARE: Patentin version 3.0 

49 <210> SEQ ID NO: 1 

50 <211> LENGTH: 2506 

51 <212> TYPE: DNA 

52 <213> ORGANISM: Artificial Sequence 

54 <220> FEATURE: 

55 <223> OTHER INFORMATION: Synthetic 

57 <400> SEQUENCE: 1 

58 atgaggggga tgctgcccct ctttgagccc aagggccggg tcctcctggt ggacggccac 60 
60 cacctggcct accgcacctt ccacgccctg aagggcctca ccaccagccg gggggagccg 120 
62 gtgcaggcgg tctacggctt cgccaagagc ctcctcaagg ccctcaagga ggacggggac 180 
64 gcggtgatcg tggtctttga cgccaaggcc ccctccttcc gccacgaggc ctacgggggg 240 
66 tacaaggcgg gccgggcccc cacgccggag gactttcccc ggcaactcgc cctcatcaag 300 
68 gagctggtgg acctcctggg gctggcgcgc ctcgaggtcc cgggctacga ggcggacgac 360 
70 gtcctggcca gcctggccaa gaaggcggaa aaggagggct acgaggtccg catcctcacc 4 20 


file://C:\Crf3\Outhold\VsrI71360Lhtm 
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RAW SEQUENCE LISTING DATE: 10/10/2001 

PATENT APPLICATION: US/09/713,601 TIME: 14:08:25 

Input Set : A:\04905 * ST25 .txt 

Output Set: N:\CRF3\10102001\I71360I, raw 

72 gccgacaaag acctttacca gctcctttcc gaccgcatcc acgtcctcca ccccgagggg 480 

74 tacctcatca ccccggcctg gctttgggaa aagtacggcc tgaggcccga ccagtgggcc 540 

76 gactaccggg ccctgaccgg ggacgagtcc gacaaccttc ccggggtcaa gggcatcggg 600 

78 gagaagacgg cgaggaagct tctggaggag tgggggagcc tggaagccct cctcaagaac 660 

80 ctggaccggc tgaagcccgc catccgggag aagatcctgg cccacatgga cgatctgaag 720 

82 ctctcctggg acctggccaa ggtgcgcacc gacctgcccc tggaggtgga cttcgccaaa 780 

84 aggcgggagc ccgaccggga gaggcttagg gcctttctgg agaggcttga gtttggcagc 84 0 

86 ctcctccacg agttcggcct tctggaaagc cccaaggccc tggaggaggc cccctggccc 900 

88 ccgccggaag gggccttcgt gggctttgtg ctttcccgca aggagcccat gtgggccgat 960 

90 cttctggccc tggccgccgc cagggggggc cgggtccacc gggcccccga gccttataaa 1020 

92 gccctcaggg acctgaagga ggcgcggggg cttctcgcca aagacctgag cgttctggcc 1080 

94 ctgagggaag gccttggcct cccgcccggc gacgacccca tgctcctcgc ctacctcctg 1140 

96 gacccttcca acaccacccc cgagggggtg gcccggcgct acggcgggga gtggacggag 1200 

98 gaggcggggg agcgggccgc cctttccgag aggctcttcg ccaacctgtg ggggaggctt 1260 

100 gagggggagg agaggctcct ttggctttac cgggaggtgg agaggcccct ttccgctgtc 1320 

102 ctggcccaca tggaggccac gggggtgcgc ctggacgtgg cctatctcag ggccttgtcc 1380 

104 ctggaggtgg ccgaggagat cgcccgcctc gaggccgagg tcttccgcct ggccggccac 144 0 

106 cccttcaacc tcaactcccg ggaccagctg gaaagggtcc tctttgacga gctagggctt 1500 

108 cccgccatcg gcaagacgga gaagaccggc aagcgctcca ccagcgccgc cgtcctggag 1560 

110 gccctccgcg aggcccaccc catcgtggag aagatcctgc agtaccggga gctcaccaag 1620 

112 ctgaagagca cctacattga ccccttgccg gacctcatcc accccaggac gggccgcctc 1680 

114 cacacccgct tcaaccagac ggccacggcc acgggcaggc taagtagctc cgatcccaac 1740 

116 ctccagaaca tccccgtccg caccccgctt gggcagagga tccgccgggc cttcatcgcc 1800 

118 gaggaggggt ggctattggt ggccctggac tatagccaga tagagctcag ggtgctggcc 1860 

120 cacctctccg gcgacgagaa cctgatccgg gtcttccagg aggggcggga catccacacg 1920 

122 gagaccgcca gctggatgtt cggcgtcccc cgggaggccg tggaccccct gatgcgccgg 1980 

124 gcggccaaga ccatcaactt cggggtcctc tacggcatgt cggcccaccg cctctcccag 2040 

126 gagctagcca tcccttacga ggaggcccag gccttcattg agcgctactt tcagagcttc 2100 

128 cccaaggtgc gggcctggat tgagaagacc ctggaggagg gcaggaggcg ggggtacgtg 2160 

130 gagaccctct tcggccgccg ccgctacgtg ccagacctag aggcccgggt gaagagcgtg 2220 

132 cgggaggcgg ccgagcgcat ggccttcaac atgcccgtcc agggcaccgc cgccgacctc 2280 

134 atgaagctgg ctatggtgaa gctcttcccc aggctggagg aaatgggggc caggatgctc 2340 

136 cttcaggtcc acgacgagct ggtcctcgag gccccaaaag agagggcgga ggccgtggcc 2400 

138 cggctggcca aggaggtcat ggagggggtg tatcccctgg ccgtgcccct ggaggtggag 2460 

140 gtggggatag gggaggactg gctctccgcc aaggagtgat accacc 2506 

143 <210> SEQ ID NO: 2 

144 <211> LENGTH: 2496 

145 <212> TYPE: DNA 

146 <213> ORGANISM: Artificial Sequence 
14 8 <22 0> FEATURE: 

149 <223> OTHER INFORMATION: Synthetic 

151 <400> SEQUENCE: 2 

152 atggcgatgc ttcccctctt tgagcccaaa ggccgcgtgc tcctggtgga cggccaccac 60 
154 ctggcctacc gcaccttctt tgccctcaag ggcctcacca ccagccgcgg cgaacccgtt 120 
156 caggcggtct acggcttcgc caaaagcctc ctcaaggccc tgaaggagga cggggacgtg 180 
158 gtggtggtgg tctttgacgc caaggccccc tccttccgcc acgaggccta cgaggcctac 240 
160 aaggcgggcc gggcccccac cccggaggac tttccccggc agctggccct catcaaggag 300 
162 ttggtggacc tcctaggcct tgtgcggctg gaggttcccg gctttgaggc ggacgacgtg 360 
164 ctggccaccc tggccaagcg ggcggaaaag gaggggtacg aggtgcgcat cctcactgcc 420 


file://C:\Crf3\Outhold\VsrI713601.htm 
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RAW SEQUENCE LISTING DATE: 10/10/2001 

PATENT APPLICATION: US/09/713 , 601 TIME: 14:08:25 

Input Set : A: \04905 . ST25 , txt 

output Set: N:\CRF3\10102001\I713601, raw 

166 gaccgcgacc tctaccagct cctttcggag cgcatcgcca tcctccaccc tgaggggtac 480 

168 ctgatcaccc cggcgtggct ttacgagaag tacggcctgc gcccggagca gtgggtggac 540 

170 taccgggccc tggcggggga cccctcggat aacatccccg gggtgaaggg catcggggag 600 

172 aagaccgccc agaggctcat ccgcgagtgg gggagcctgg aaaacctctt ccagcacctg 660 

174 gaccaggtga agccctcctt gcgggagaag ctccaggcgg gcatggaggc cctggccctt 720 

176 tcccggaagc tttcccaggt gcacactgac ctgcccctgg aggtggactt cgggaggcgc 780 

178 cgcacaccca acctggaggg tctgcgggct tttttggagc ggttggagtt tggaagcctc 840 

180 ctccacgagt tcggcctcct ggaggggccg aaggcggcag aggaggcccc ctggccccct 900 

182 ccggaagggg cttttttggg cttttccttt tcccgtcccg agcccatgtg ggccgagctt 960 

184 ctggccctgg ctggggcgtg ggaggggcgc ctccatcggg cacaagaccc ccttaggggc 1020 

186 ctgagggacc ttaagggggt gcggggaatc ctggccaagg acctggcggt tttggccctg 1080 

188 cgggagggcc tggacctctt cccagaggac gaccccatgc tcctggccta ccttctggac 1140 

190 ccctccaaca ccacccctga gggggtggcc cggcgttacg ggggggagtg gacggaggat 1200 

192 gcgggggaga gggccctcct ggccgagcgc ctcttccaga ccctaaagga gcgccttaag 1260 

194 ggagaagaac gcctgctttg gctttacgag gaggtggaga agccgctttc ccgggtgttg 1320 

196 gcccggatgg aggccacggg ggtccggctg gacgtggcct acctccaggc cctctccctg 1380 

198 gaggtggagg cggaggtgcg ccagctggag gaggaggtct tccgcctggc cggccacccc 144 0 

200 ttcaacctca actcccgcga ccagctggag cgggtgctct ttgacgagct gggcctgcct 1500 

202 gccatcggca agacggagaa gacggggaaa cgctccacca gcgctgccgt gctggaggcc 1560 

204 ctgcgagagg cccaccccat cgtggaccgc atcctgcagt accgggagct caccaagctc 1620 

206 aagaacacct . acatagaccc cctgcccgcc ctggtccacc ccaagaccgg ccggctccac 1680 

208 acccgcttca accagacggc caccgccacg ggcaggcttt ccagctccga ccccaacctg 1740 

210 cagaacatcc ccgtgcgcac ccctctgggc cagcgcatcc gccgagcctt cgtggccgag 1800 

212 gagggctggg tgctggtggt cttggactac agccagattg agcttcgggt cctggcccac 1860 

214 ctctccgggg acgagaacct gatccgggtc tttcaggagg ggagggacat ccacacccag 1920 

216 accgccagct ggatgttcgg cgtttccccc gaaggggtag accctctgat gcgccgggcg 1980 

218 gccaagacca tcaacttcgg ggtgctctac ggcatgtccg cccaccgcct ctccggggag 2040 

220 ctttccatcc cctacgagga ggcggtggcc ttcattgagc gctacttcca gagctacccc 2100 

222 aaggtgcggg cctggattga ggggaccctc gaggagggcc gccggcgggg gtatgtggag 2160 

224 accctcttcg gccgccggcg ctatgtgccc gacctcaacg cccgggtgaa gagcgtgcgc 2220 

226 gaggcggcgg agcgcatggc cttcaacatg ccggtccagg gcaccgccgc cgacctcatg 2280 

228 aagctggcca tggtgcggct tttcccccgg cttcaggaac tgggggcgag gatgcttttg 234 0 

230 caggtgcacg acgagctggt cctcgaggcc cccaaggacc gggcggagag ggtagccgct 2400 

232 ttggccaagg aggtcatgga gggggtctgg cccctgcagg tgcccctgga ggtggaggtg 2460 

234 ggcctggggg aggactggct ctccgccaag gagtag 2496 

237 <210> SEQ ID NO: 3 

238 <211> LENGTH: 2504 

239 <212> TYPE: DNA 

240 <213> ORGANISM: Artificial Sequence 

242 <220> FEATURE: 

243 <223> OTHER INFORMATION: Synthetic 
24 5 <4 00> SEQUENCE: 3 

246 atggaggcga tgcttccgct ctttgaaccc aaaggccggg tcctcctggt ggacggccac 60 

248 cacctggcct accgcacctt cttcgccctg aagggcctca ccacgagccg gggcgaaccg 120 

250 gtgcaggcgg tctacggctt cgccaagagc ctcctcaagg ccctgaagga ggacgggtac 180 

252 aaggccgtct tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacgag 240 

254 gcctacaagg cggggagggc cccgaccccc gaggacttcc cccggcagct cgccctcatc 300 

2 56 aaggagctgg tggacctcct ggggtttacc cgcctcgagg tccccggcta cgaggcggac 360 

2 58 gacgttctcg ccaccctggc caagaaggcg gaaaaggagg ggtacgaggt gcgcatcctc 420 
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RAW SEQUENCE LISTING DATE: 10/10/2001 

PATENT APPLICATION: US/09/713,601 TIME: 14:08:25 

Input Set : A:\04905.ST25.txt 

Output Set: N:\CRF3\10102001\I713601.raw 

260 accgccgacc gcgacctcta ccaactcgtc tccgaccgcg tcgccgtcct ccaccccgag 480 

262 ggccacctca tcaccccgga gtggctttgg gagaagtacg gcctcaggcc ggagcagtgg 540 

264 gtggacttcc gcgccctcgt gggggacccc tccgacaacc tccccggggt caagggcatc 600 

266 ggggagaaga ccgccctcaa gctcctcaag gagtggggaa gcctggaaaa cctcctcaag 660 

268 aacctggacc gggtaaagcc agaaaacgtc cgggagaaga tcaaggccca cctggaagac 720 

270 ctcaggctct ccttggagct ctcccgggtg cgcaccgacc tccccctgga ggtggacctc 780 

272 gcccaggggc gggagcccga ccgggagggg cttagggcct tcctggagag gctggagttc 840 

274 ggcagcctcc tccacgagtt cggcctcctg gaggcccccg cccccctgga ggaggccccc 900 

276 tggcccccgc cggaaggggc cttcgtgggc ttcgtcctct cccgccccga gcccatgtgg 960 

278 gcggagctta aagccctggc cgcctgcagg gacggccggg tgcaccgggc agcagacccc 1020 

280 ttggcggggc taaaggacct caaggaggtc cggggcctcc tcgccaagga cctcgccgtc 1080 

282 ttggcctcga gggaggggct agacctcgtg cccggggacg accccatgct cctcgcctac 1140 

284 ctcctggacc cctccaacac cacccccgag ggggtggcgc ggcgctacgg gggggagtgg 1200 

286 acggaggacg ccgcccaccg ggccctcctc tcggagaggc tccatcggaa cctccttaag 1260 

288 cgcctcgagg gggaggagaa gctcctttgg ctctaccacg aggtggaaaa gcccctctcc 1320 

290 cgggtcctgg cccacatgga ggccaccggg gtacggctgg acgtggccta ccttcaggcc 1380 

292 ctttccctgg agcttgcgga ggagatccgc cgcctcgagg aggaggtctt ccgcttggcg 1440 

294 ggccacccct tcaacctcaa ctcccgggac cagctggaaa gggtgctctt tgacgagctt 1500 

296 aggcttcccg ccttggggaa gacgcaaaag acaggcaagc gctccaccag cgccgcggtg 1560 

298 ctggaggccc tacgggaggc ccaccccatc gtggagaaga tcctccagca ccgggagctc 1620 

300 accaagctca agaacaccta cgtggacccc ctcccaagcc tcgtccaccc gaggacgggc 1680 

302 cgcctccaca cccgcttcaa ccagacggcc acggccacgg ggaggcttag tagctccgac 1740 

304 cccaacctgc agaacatccc cgtccgcacc cccttgggcc agaggatccg ccgggccttc 1800 

306 gtggccgagg cgggttgggc gttggtggcc ctggactata gccagataga gctccgcgtc 1860 

308 ctcgcccacc tctccgggga cgaaaacctg atcagggtct tccaggaggg gaaggacatc 1920 

310 cacacccaga ccgcaagctg gatgttcggc gtccccccgg aggccgtgga ccccctgatg 1980 

312 cgccgggcgg ccaagacggt gaacttcggc gtcctctacg gcatgtccgc ccataggctc 2040 

314 tcccaggagc ttgccatccc ctacgaggag gcggtggcct ttatagaggc tacttccaaa 2100 

316 gcttccccaa ggtgcgggcc tggatagaaa agaccctgga ggaggggagg aagcggggct 2160 

318 acgtggaaac cctcttcgga agaaggcgct acgtgcccga cctcaacgcc cgggtgaaga 2220 

320 gcgtcaggga ggccgcggag cgcatggcct tcaacatgcc cgtccagggc accgccgccg 2280 

322 acctcatgaa gctcgccatg gtgaagctct tcccccgcct ccgggagatg ggggcccgca 2 340 

324 tgctcctcca ggtccacgac gagctcctcc tggaggcccc ccaagcgcgg gccgaggagg 2400 

326 tggcggcttt ggccaaggag gccatggaga aggcctatcc cctcgccgtg cccctggagg 2460 

328 tggaggtggg gatgggggag gactggcttt ccgccaaggg ttag 2504 

331 <210> SEQ ID NO: 4 

332 <211> LENGTH: 832 

333 <212> TYPE: PRT 

334 <213> ORGANISM: Artificial Sequence 

336 <22 0> FEATURE: 

337 <223> OTHER INFORMATION: Synthetic 
339 <4 00> SEQUENCE: 4 

341 Met Arg Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu 

342 15 10 15 

34 4 Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys Gly 

345 20 25 30 

34 7 Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala 

348 35 40 45 

350 Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie Val 
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RAW SEQUENCE LISTING DATE: 10/10/2001 

PATENT APPLICATION: US/09/713,601 TIME: 14:08:25 


Input Set : A:\04905.ST25.txt 

Output Set: N:\CRF3\10102 001\I713601.raw 
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Glu 

360 




1 U U 





1 u o 





110 



362 

Val 

Pro 


Tyr 

/"111 

(jIU 

A±a 

Asp 

Asp 

val 

Leu 

Ala 

Ser 

Leu 

Ala 

Lys 

Lys 

363 



115 





ion 
1 z u 





12 5 


365 

Ala 

o J. U. 

Lys 

OIU 

(jiy 

Tyr 

tjlU 

val 

Arg 

lie 

Leu 

Thr 

Ala 

Asp 

Lys 

Asp 

366 


130 





X O J 





14 0 


368 

Leu 

TVT 

x_y X 

Gin 

Leu 

Leu 

Q O T" 

OCX 

Asp 

Arg 

lie 

His 

Val 

Leu 

His 

Pro 

Glu 

Gly 

369 

145 





1 RH 

X D U 





loo 





160 

371 

±y± 

Leu 

J. X c 

Thr 

Pro 

M.xa 

Trp 

Leu 

Trp 

/"111 

Glu 

Lys 

Tyr 

Gly 

Leu 

Arg 

Pro 

372 





Ibo 





1 / (J 





175 


374 

Asp 

Gin 

irp 

Ala 

Asp 

Tyr 

Arg 

A 1 

Ala 

Leu 

Thr 

Gly 

Asp 

Glu 

Ser 

Asp 

Asn 

375 




1 O U 





lob 





190 


377 

Leu 

tr J. KJ 


val 

Lys 

VjrXy 

lie 

Gly 

Glu 

Lys 

Thr 

Ala 

Arg 

Lys 

Leu 

Leu 

378 



X -7 ^ 





o n n 
U U 





205 




380 

CZl n 

Vj X u. 

VJ J. U 

Trp 

(jiy 

Ser 

Leu 

/"111 
oiu 

Ala 

Leu 

Leu 

Lys 

Asn 

Leu 

Asp 

Arg 

Leu 

381 


210 





Z 1 D 





22 0 



383 

Lys 

Pro 

Ala 

lie 

Arg 

Pill 
^x u 

Lys 

lie 

Leu 

Ala 

His 

Met 

Asp 

Asp 

Leu 

Lys 

384 

225 





9 n 
J u 





235 




240 

386 

Leu 

Ser 

±xp 

Asp 

Leu 

a 1 a 

iix a 

Lys 

vai 

Arg 

Thr 

Asp 

Leu 

Pro 

Leu 

Glu 

Val 

387 





z4 3 





250 





255 


389 

Asp 

It Xlt; 


Lys 

Arg 

Arg 

Glu 

Pro 

Asp 

Arg 

Glu 

Arg 

Leu 

Arg 

Ala 

Phe 

390 




0 n 

^ D u 





265 





270 



392 

Leu 

Vjrx Li 

rtX y 

Leu 

Glu 

irne 

Gly 

Ser 

Leu 

Leu 

His 

Glu 

Phe 

Gly 

Leu 

Leu 

393 



9 7 R 





280 





285 



395 

Glu 

OCX 

r^X vJ 

Lys 

Ala 

Leu 

Glu 

Glu 

Ala 

Pro 

Trp 

Pro 

Pro 

Pro 

Glu 

Gly 

396 


290 





295 





300 




398 

Ala 

Phe 

V d X 

oiy 

Phe 

Val 

Leu 

Ser 

Arg 

Lys 

Glu 

Pro 

Met 

Trp 

Ala 

Asp 

399 

305 





310 





315 




320 

401 

Leu 

Leu 

Ala 

Leu 

Ala 

Ala 

Ala 

Arg 

Gly Gly Arg 

Val 

His 

Arg 

Ala 

Pro 

402 





325 





330 





335 


404 

Glu 

Pro 

Tyr 

Lys 

Ala 

Leu 

Arg Asp Leu *Lys Glu Ala 

Arg 

Gly 

Leu 

Leu 

405 




340 





345 





350 



407 

Ala 

Lys 

Asp 

Leu 

Ser 

Val 

Leu 

Ala 

Leu 

Arg 

Glu 

Gly 

Leu 

Gly Leu 

Pro 

408 



355 





360 





365 




410 

Pro 

Gly 

Asp 

Asp 

Pro 

Met 

Leu 

Leu 

Ala 

Tyr 

Leu 

Leu 

Asp 

Pro 

Ser 

Asn 

411 


370 





375 





380 




413 

Thr 

Thr 

Pro 

Glu 

Gly Val 

Ala 

Arg 

Arg 

Tyr 

Gly Gly 

Glu 

Trp 

Thr 

Glu 

414 

385 





390 





395 





400 

416 

Glu 

Ala 

Gly 

Glu 

Arg 

Ala 

Ala 

Leu 

Ser 

Glu 

Arg 

Leu 

Phe 

Ala 

Asn 

Leu 

417 





405 





410 





415 


419 

Trp Gly Arg 

Leu 

Glu 

Gly Glu 

Glu 

Arg 

Leu 

Leu 

Trp 

Leu 

Tyr 

Arg 

Glu 

420 




420 





425 





430 


422 

Val 

Glu 

Arg 

Pro 

Leu 

Ser 

Ala 

Val 

Leu 

Ala 

His 

Met 

Glu 

Ala 

Thr 

Gly 

423 



435 





440 





445 
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T 
Lj 

• 1 0 R 1 
. X U J J. 

M 
JYi 

• 7 A 1 
. O ^ X 


^ 4 0 J 

II — n 

n 

or 

" Xaa " 

used. 

for 

SEQ 

ID# 

: 7 

T 
-Lj 

' 1 0^7 

. ±\J D / 

M 

n 

• A 1 

w . 

^ 4 D ^ 

n 

or 

II "V -1 II 

xaa 

used, 

for 

SEQ 

ID# 

: 7 

T 
J_i 

. ± U D O 


• '5 A 1 
. o 4 X 

w : 


n 

or 

"Xaa " 

used. 

for 

SEQ 

ID# 

: 7 

T 

Xj 

• 1 nil 

M 

• T A 1 
. J ft X 

ri : 

/ A C \ 
( 4 D } 

n _ ti 

n 

or 

Xaa 

used/ 

for 

SEQ 

ID# 

: 7 

T 
J-l 

• 1 n 7 

M 
iYl 

• A 1 
. J H X 

w . 

f A ^ \ 

n 

or 

" Xaa " 

used, 

for 

SEQ 

ID# 

: 7 

T 

Ls 

• 1 fi 7 c; 

rl 

. '5 A 1 

: J ft X 

w : 

t A a \ 

(4b) 

n »-i II 

n 

or 

"Xaa " 

used, 

for 

SEQ 

ID# 

: 7 

T 
J_l 

. X U / / 

JM 

: J ft X 

w : 

f A a \ 

n 

or 

"Xaa " 

used. 

for 

SEQ 

ID# 

: 7 

T 
Lj 

. X U O O 

JM 

• "5 /I 1 

: J ft X 

w : 

(4b) 

n 

or 

"Xaa " 

used. 

for 

SEQ 

ID# 

: 7 

T. 
J-l 

• 1 nR7 
. X u o / 


. o 't X 

w : 

(4b) 

n 

or 

ft "v -» n 

Xaa 

used. 

for 

SEQ 

ID# 

: 7 

L 

. X U -7 X 

M 

. J *i X 


( 4 D J 

n 

or 

Aaa 

used , 

-V 
ror 

SEQ 

ID# 

: 7 

T 
J-l 

. J. \j y o 

M 

• '^Al 

. O ^ X 

rV . 

(4b) 

n 

or 

ft "V -> -» It 

Xaa 

used. 

for 

SEQ 

ID# 

: 7 

T 
J-l 

. X u 2? 

JXl 

• A 1 
. J ^ X 

w : 

f A \ 

(4b) 

n ti 

n 

or 

"Xaa " 

used. 

fox 

SEQ 

ID# 

: 7 

T 
J-l 

• 1 1 m 

. X X U X 

M 

cl 

• A 1 
J ft X 

w : 

t A a \ 

(4b) 

n 

or 

"Xaa " 

used. 

for 

SEQ 

ID# 

: 7 

T. 

. X X u o 

n 

A 1 
J ft X 

w : 

(4b) 

n 

or 

It "V -S -X II 

Xaa 

used. 

for 

SEQ 

ID# 

: 7 

L 

1105 

i 1 

A 1 

J t X 


( Af.\ 
(40) 

n 

or 

Aaa 

used, 

tor 

SEQ 

ID# 

: 7 

L 

1109 


A1 

J ^ X 


t Af.\ 
(40) 

ft _ n 

n 

or 

n -v It 

Aaa 

used, 

tor 

SEQ 

ID# 

: 7 

T, 
J-l 

1111 

X X X X 

M 
1*1 

A 1 

J *i X 

w . 

t A^\ 
(4b) 

n 

or 

Aaa 

used, 

for 

SEQ 

ID# 

: 7 

J_t 

1 1 97 

X X ^ / 

M 

Vi 

A 1 
J ft X 

VY . 

(4b) 

n 

or 

It V -I -» ft 

Aaa 

used. 

for 

SEQ 

ID# 

: 7 

T, 
J-l 

1 1 

X X ^ -7 

1*1 

J t X 

W . 

f A^ \ 
(4b) 

n 

or 

Aaa 

used, 

for 

SEQ 

ID# 

• 7 

L 

1 ^DR 

X J \J o 

M 
ITI 

A 1 
J ^ X 

W . 

f A^ \ 
(40) 

n 

or 

Aaa 

used, 

tor 

SEQ 

ID# 

8 

jj 

1317 


O *i X 


f A^\ 
(40) 

n — n 

n 

or 

n V -1 ft 

Aaa 

used, 

for 

SEQ 

ID# 

8 

T. 
J_i 

X o z o 

rl . 

A 1 
J ft X 

w : 

/ A C N 

(4b) 

n 

or 

It V — — II 

Aaa 

used, 

for 

SEQ 

ID# 

8 

T 

ij 1 

1 A 1 
X J ^ X 

M • 
JXL . 

T A 1 
J 4 X 

TaT . 

w : 

/ A C \ 

(4b) 

n 

or 

"Xaa " 

used. 

for 

SEQ 

ID# 

8 

J-l . 

1 A A 
X J 4 

rl : 

o4 J. 

w : 

( 4 D ) 

n 

or 

"Xaa " 

used. 

for 

SEQ 

ID# 

8 

J-l . 

1 7 A 7 
X J ft / 

rl : 

O ft X 

Ta7 . 

w : 

f A C \ 

(4b) 

n 

or 

"Xaa " 

used, 

for 

SEQ 

ID# 

8 

T . 

J-l . 

1 7 fi 
X J O U 

JM : 

J 4 X 

Ta7 < 

(4b) 

n 

or 

"Xaa " 

used, 

for 

SEQ 

ID# : 

8 

T • 

1 S 
X J O J 

n : 

"5 A 1 
J ft X 

Ta7 > 

w : 

f A a \ 
(4b) 

It i-i n 

n 

or 

" Xaa " 

used. 

for 

SEQ 

ID# : 

8 

T . 
Jj . 

X J O D 

JM : 

1/11 

o4 X 

Ta7 * 

w : 

f A C \ 

(4b) 

fi n 

n 

or 

"Xaa " 

used. 

for 

SEQ 

ID# : 

8 

L : 

1362 

M : 

341 

W : 

(46) 

"n" 

or 

"Xaa" 

used. 

for 

SEQ 

ID# : 

8 

L: 

1368 

M: 

341 

W: 

(46) 

"n" 

or 

"Xaa" 

used. 

for 

SEQ 

ID# : 

8 

L: 

1371 

M: 

341 

W: 

(46) 

"n" 

or 

"Xaa" 

used, 

for 

SEQ 

ID# : 

8 

L: 

1374 

M: 

341 

W: 

(46) 

"n" 

or 

"Xaa" 

used. 

for 

SEQ 

ID# : 

8 

L: 

1383 

M: 

341 

W: 

(46) 

"n" 

or 

"Xaa" 

used. 

for 

SEQ 

ID# : 

8 

L: 

1386 

M: 

341 

W: 

(46) 

"n" 

or 

"Xaa" 

used. 

for 

SEQ 

ID# : 

8 

L: 

1410 

M: 

341 

W: 

(46) 

"n" 

or 

"Xaa" 

used. 

for 

SEQ 

ID# : 

8 

L: 

1419 

M: 

341 

W: 

(46) 

"n" 

or 

"Xaa" 

used. 

for 

SEQ 

ID# : 

8 

L: 

1452 

M: 

341 

W: 

(46) 

"n" 

or 

"Xaa" 

used. 

for 

SEQ 

ID# : 

8 

L: 

1455 

M: 

341 

W: 

(46) 

"n" 

or 

"Xaa" 

used. 

for 

SEQ 

ID# : 

8 

L: 

1461 

M: 

341 

W: 

(46) 

"n" 

or 

"Xaa" 

used. 

for 

SEQ 

ID# : 

8 

L: 

1464 

M: 

341 

W: 

(46) 

"n" 

or 

"Xaa" 

used. 

for 

SEQ 

ID# : 

8 


2569 

M: 

258 

W: 

Mandatory 

Feature 

missing, <220> 

FEATURE 


Ji-:2569 M:258 W: Mandatory Feature missing, <223> OTHER INFORMATION: 

L:2918 M:258 W: Mandatory Feature missing, <220> FEATURE: 

^:2918 M:258 W: Mandatory Feature missing, <223> OTHER INFORMATION: 

0 L:3375 M:258 W: Mandatory Feature missing, <220> FEATURE: 

J^:3375 M:258 W: Mandatory Feature missing, <223> OTHER INFORMATION: 

^,vL:3820 M:258 W: Mandatory Feature missing, <220> FEATURE: 
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Lj3820 M 
L:4161 M 
'^'>L:4161 M 
^TL:4378 M 
L:4378 M 
1,^4807 M 
L:4831 M 
L:4849 M 


4867 M: 
7058 M: 
7094 M: 
7130 M: 
7148 M: 
7184 M: 
7227 M: 


258 W 
258 W 
258 W 
258 
258 W 
341 W 
341 W 
341 
341 
341 W 
341 W 
341 W 
341 W 
341 W 
341 W 


W: 


W: 
W: 


Input Set : 
Output Set: 

Mandatory Feature 
Mandatory Feature 
Mandatory Feature 
Mandatory Feature 
Mandatory Feature 


A:\04905.ST25.txt 
N:\CRF3\10102001\I713601.raw 

missing, <223> OTHER INFORMATION: 
missing, <220> FEATURE: 
missing, <22 3> OTHER INFORMATION: 
missing, <220> FEATURE: 
missing, <22 3> OTHER INFORMATION: 
(46) "n" or "Xaa" used, for SEQ ID#:99 
(46) "n" or "Xaa" used, for SEQ ID#:100 
(46) "n" or "Xaa" used, for SEQ ID#:101 
(46) "n" or "Xaa" used, for SEQ ID#:102 
(46) "n" or "Xaa" used, for SEQ ID#:170 
(46) "n" or "Xaa" used, for SEQ ID#:172 
(46) "n" or "Xaa" used, for SEQ ID#:174 
(46) "n" or "Xaa" used, for SEQ ID#:175 
(46) "n" or "Xaa" used, for SEQ ID#:177 
(46) "n" or "Xaa" used, for SEQ ID#:179 
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O^I'^\3,(oO\ 


Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly 
290 295 300 

ctt ctg gaa age ccc aag gcc gca etc gag eae eac cae eac cae eac 960 
Leu Leu Glu Ser Pro Lys Ala Ala Leu Glu His His His His His His 
305 310 315 320 

tga 963 


<210> 61 
<211> 320 
<212> PRT 

<213> Artifieial Sequence 
<400> 61 

Met Ala Ser Met Thr Gly Gly Gin Gin Met Gly Arg lie Asn Ser Gly 
15 10 15 


Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu Val Asp Gly 
20 25 30 


His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys Gly Leu Thr Thr 
35 40 45 


Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala Lys Ser Leu 
50 55 60 


Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie Val Val Phe Asp 
65 70 75 80 


ccc gtc cag ggc acc gcc gcc gac etc atg aag ctg get atg gtg aag 2304 
Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

etc ttc ccc agg ctg gag gaa atg ggg gcc agg atg etc ett cag gtc 2352 
Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

cac aac gag ctg gtc etc gag gcc eca aaa gag agg gcg gag gcc gtg 2400 
His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

gee egg ctg gee aag gag gtc atg gag ggg gtg tat ccc ctg gee gtg 2448 
Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

ccc ctg gag gtg gag gtg ggg ata ggg gag gac tgg etc tec gcc aag 2496 
Pro Leu Glu Val Glu Val Gly lie Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

gag tgatag 2505 
Glu 


<210> 66 
<211> 833 
<212> PRT 

<213> Artrficial Sequence 
<400> 66 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 


<210> 69 
<211> 833 
<212> PRT 
.^<213> Artificial Sequence 

<400> 69 

iVIet Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 


Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 


Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 


Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 


Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 


Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 


Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 


<210> 71 
<211> 833 
<212> PRT 

<213> Artificial Sequence 
<400> 71 

iVIet Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 


Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 


Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 


Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 
50 55 60 


Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 


Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 


Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 


290 295 300 

gtt gat aaa etc tat aac tta att gca aac aaa act aag caa aaa aca 960 
Val Asp Lys Leu Tyr Asn Leu lie Ala Asn Lys Thr Lys Gin Lys Thr 
305 310 315 320 

tta gat gca tgg ttt aaa taa 981 
Leu Asp Ala Trp Phe Lys 
325 


<210> 75 

<211> 326 

<212> PRT 

^P-s:213> Artificial Sequence 

<400> 75 

Met Gly Val Gin Phe Gly Asp Phe lie Pro Lys Asn lie lie Ser Phe 
15 10 15 


Glu Asp Leu Lys Gly Lys Lys Val Ala lie Asp Gly Met Asn Ala Leu 
20 25 30 


Tyr Gin Phe Leu Thr Ser lie Arg Leu Arg Asp Gly Ser Pro Leu Arg 
35 40 45 


Asn Arg Lys Gly Glu lie Thr Ser Ala Tyr Asn Gly Val Phe Tyr Lys 
50 55 60 


Thr He His Leu Leu Glu Asn Asp He Thr Pro lie Trp Val Phe Asp 
65 70 75 80 


